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ABSTRACT 

We present new results for the 3-point correlation function, £, measured as a function 

■ of scale, luminosity and colour from the final version of the two-degree field galaxy 
t^J- \ redshift survey (2dFGRS). The reduced three point correlation function, Q3 ~ C/S, 2 , 

■ is estimated for different triangle shapes and sizes, employing a full covariance anal- 
^3 ■ ysis. The form of Q3 is consistent with the expectations for the A-cold dark matter 

\ model, confirming that the primary influence shaping the distribution of galaxies is 

•■/"") ■ gravitational instability acting on Gaussian primordial fluctuations. However, we find 

a clear offset in amplitude between Q3 for galaxies and the predictions for the dark 
matter. We are able to rule out the scenario in which galaxies are unbiased tracers of 
O the mass at the 9- a level. On weakly non-linear scales, we can interpret our results 

■ I in terms of galaxy bias parameters. We find a linear bias term that is consistent with 

unity, bi — 0.93^;^ and a quadratic bias C2 = &2/&1 = — 0.34T.g;Jg. This is the first 
significant detection of a non-zero quadratic bias, indicating a small but important 
non-gravitational contribution to the three point function. Our estimate of the linear 
bias from the three point function is independent of the normalisation of underlying 
density fluctuations, so we can combine this with the measurement of the power spec- 
trum of 2dFGRS galaxies to constrain the amplitude of matter fluctuations. We find 



that the rms linear theory variance in spheres of radius 8h 1 Mpc is as = 0.88^q 



+0.12 

10' 



X 

providing an independent confirmation of values derived from other techniques. On 
non- linear scales, where £ > 1, we find that Q3 has a strong dependence on scale, 
colour and luminosity. 

Key words: galaxies: statistics, cosmology: theory, large-scale structure. 



1 INTRODUCTION 

The higher order statistics of galaxy clustering encode fun- 
damental information about two key dynamical aspects of 
the large scale structure of the Universe: the growth mecha- 
nism of fluctuations and the connection between the galaxy 
distribution and the underlying mass (for a review, see 
Bernardeau et al. 2002). An accurate measurement of the 
three-point correlation function of galaxies has the poten- 
tial to test the gravitational instability paradigm of struc- 
ture formation and, on scales that are evolving in the weakly 
non-linear regime, to separate the effects of gravity from the 
contributions arising from galaxy bias (Fry & Gaztanaga 
1993; Frieman & Gaztanaga 1994). 

The measurement of the three-point function and other 
higher order statistics from galaxy catalogues has a rich 
history (Peebles & Groth 1975; Groth & Peebles 1977; 



Fry & Peebles 1978; Baumgart & Fry 1991; Gaztanaga 
1992; Bouchet et al. 1993; Fry & Gaztanaga 1994). In the 
past decade, three-point statistics have supported the ba- 
sic premise of gravitational instability from Gaussian initial 
conditions (Frieman & Gaztanaga 1994; Jing & Borner 1998; 
Frieman & Gaztanaga 1999; Hoyle, Szapudi & Baugh 2000; 
Feldman et al. 2001). The impact of these measurements 
on theoretical models has, however, not been as great as it 
could have been for two reasons. First, the traditional theo- 
retical predictions rely upon the application of perturbation 
theory, which limits the comparison with data to relatively 
large scales on which the fluctuations are evolving in a linear 
or weakly non-linear fashion. Second, previous generations 
of galaxy surveys simply covered too little volume to permit 
accurate measurements of the higher order correlation func- 



2 Gaztanaga et al. 



tions on the scales that could strongly constrain the simple 
theoretical models. 

Recent theoretical and observational advancess have 
been such that we are now in a position to realize the full 
potential of higher order statistics. Theoretical models of 
galaxy formation have progressed sufficiently to make pre- 
dictions for the number of galaxies that reside in dark mat- 
ter haloes of different mass (Benson et al. 2000; Peacock & 
Smith 2000; Scoccimarro et al. 200fb; Berlind et al. 2004). 
This allows the prediction to be extended to scales for which 
perturbation theory is not valid, and provides a framework 
for testing the physics of galaxy formation directly against 
clustering measurements. Observationally, two recent sur- 
veys have revolutionized our view of the local Universe: 
the two-degree field galaxy redshift survey (hereafter 2dF- 
GRS; Colless et al. 2001) and the Sloan Digital Sky Survey 
(SDSS; York et al. 2000). The ten-fold increase in survey size 
achieved by these projects means that precision measure- 
ments of higher order statistics are now be possible across a 
range of scales (Matarrese, Verde & Heavens 1997; Colombi, 
Szapudi & Szalay 1998; Szapudi, Colombi & Bernardeau 
1999; Scoccimarro, Sefusatti & Zaldarriaga 2004; Sefusatti 
& Scoccimarro 2005). The higher order clustering measure- 
ments that are possible with these surveys have the potential 
to tighten the accepted values of basic cosmological param- 
eters and to constrain the physics of galaxy formation that 
govern how galaxies are clustered. 

There have have been several analyses of the distribu- 
tion of counts- in- cells using the final 2dFGRS catalogue. 
Baugh et al. (2004) demonstrated that the higher order 
correlation functions display a hierarchical scaling, S p = 
i P /§2~ i wnere £,p is the p-point, volume averaged corre- 
lation function; this behaviour is expected if gravity plays a 
dominant role in shaping the distribution of galaxies. Cro- 
ton et al. (2004b) found that the scaling of the hierarchical 
co-efficients show a weak (if any) dependence on galaxy lu- 
minosity. In the case of the three point volume averaged 
correlation function, both authors found that the skewness, 
•S"3 = Cs/Cl - 2. This value was found to be independent of 
cell size, though both Baugh et al. and Croton et al. noted 
that two large superstructures in the 2dFGRS volume broke 
this scale invariance in catalogues characterised by L* galax- 
ies. The result for the skewness of galaxies, S3 ~ 2, is at 
odds with the expectation for a ACDM cosmology, in which 
Sf ~ 3. We note that Conway et al. (2005) and Wild et al. 
(2005) have also looked at the constraints that the distri- 
bution of counts-in-cells in the 2dFGRS provide on galaxy 
bias. All results from the 2dFGRS are in line with most 
previous measurements of the skewness and 3-point statis- 
tics, which are generally lower than the ACDM predictions 
(for a review, see §8 in Bernardeau et al. 2002) . This posses 
a puzzle, because the corresponding measurements for the 
variance and the 2-point function seem to follow the unbi- 
ased ACDM predictions closely on large scales. 

In this paper, we present the first general results for 
the three-point correlation function measured from the fi- 
nal 2dFGRS catalogue. Preliminary measurements of three- 
point statistics were made using early releases of the 2dF- 
GRS and SDSS datasets by Verde et al. (2002), Jing & 
Borner (2004), Wang et al. (2004) and Kayo et al. (2004). 
Pan & Szapudi (2005) measured the monopole moment of 
the three point function in the completed 2dFGRS. Our 
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Figure 1. 3-points define a triangle, which is characterized here 
by the two sides 7*12 and ri3 and the interior angle a. 



analysis has the advantage over that of Pan & Szapudi in 
that it includes information about the shapes of the triangles 
of galaxies. A further improvement over previous approaches 
is a proper treatment of the correlation between data points. 
We follow the methodology introduced by Gaztanaga & 
Scoccimarro (2005, hereafter GS05) to obtain constraints 
on bias parameters from measurements of the reduced three- 
point correlation function. 

This paper is organized as follows. In Section |5J we re- 
view some basic definitions involving the three-point func- 
tion, as well as the methodology used for its estimation. In 
Section [3] we present the 2dFGRS catalogues and the asso- 
ciated mocks. Our results are presented in Section [I] This is 
quite a long section that is divided into many subsections; 
a detailed route map is provided at the start of this section. 
Our results are compared with previous analyses of the 2dF- 
GRS in Section 5. Finally, our conclusions are presented in 
Section 6. 



2 THEORETICAL BACKGROUND 

In this section, we first give some basic definitions (§2.1), 
before discussing the expected form of the three point cor- 
relation function (§2.2). We then explain how our results can 
be related to the predictions for the three point function of 
dark matter (§2.3). For a comprehensive discussion of this 
material, we refer the reader to the review by Bernardeau 
et al. (2002). The method for estimating the three point 
function for the 2dFGRS is set out in §|21] Finally, in 
we give an outline of how our measurement of the three 
point correlation function can be used to place constraints 
on models of bias (for a complete discussion see GS05). 



2.1 Basic definitions: triangle shape and scale 

GS05 discuss the merits of various conventions for defining 
triangle shapes and scales. We adopt their preferred scheme 
in which a triangle is defined by the ratio of the lengths 
of two of the sides of the triangle, f '12/^23 and the angle 
between them, a: 

cos(a) = . . (1) 

r\2 r 2 3 

The angle a can vary between — 180 degrees; for a = 0, 
the third side of the triangle is given by T3i = ri2 — ^23 and 
for a = 180 degrees, rsi = ri2 + r23 (Fig. [I). 
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2.2 The three point correlation function 

The connected two and three point correlation functions are 
defined as: 



£(ria) = < S(n)S(r 2 ) > 
C(ri2,r23,ns) = < <5(ri)(5(r 2 )5(r 3 ) > 



(2) 
(3) 



where S(r) = p(r)/p — 1 is the local density fluctuation 
around the mean p —< p > and the expectation value is 
taken over different realizations of the model or physical 
process. In practice, the expectation value is constructed by 
averaging over different spatial locations in the Universe, 
which are assumed to form a fair sample (Peebles 1980). 

The two and three point correlation functions change 
rapidly in amplitude as a function of separation. In order 
to study the relationship between the correlation functions 
in more detail, it is useful to define the reduced three point 
function, Q 3 , (Groth & Peebles 1977): 



Q 3 



C(ri2,r 2 3, ri 3 ) 



(4) 



C,H(ri2,r 2 3,ri3) 
(h = C(ri 2 )$(r23)+$(r-i2)f(ri3) + $(r 2 3)$(ri3). (5) 

Here we have introduced a 'hierarchical' form for the three 
point function, This quantity is built up from the prod- 
ucts of two-point functions generated from cyclic permuta- 
tions of the pair separations which make up the sides of the 
triangle. When Q 3 is constant the dependence of the three 
point correlation function on triangle shape and scale is fully 
accounted for by the corresponding changes in (h', in this 
case Q3 is said to have no configuration dependence. 

Previously, Q3 was thought to be approximately con- 
stant as a function of triangle size or shape (ie see Groth & 
Peebles 1977), a phenomenon that is usually referred to as 
hierarchical scaling ( Peebles 1980 1 . However, GS05 showed 
that with sufficiently accurate theoretical predictions or for 
carefully constructed measurements, Q3 is not in fact con- 
stant in any clustering regime. Nevertheless, the variation in 
Q3 with scale is small when compared to the corresponding 
changes in £ or On small scales (< 10 ft" 1 Mpc), and for 
galaxies in redshift space (i.e. as measured by galaxy redshift 
surveys), GS05 showed that Q 3 (q) displays a characteris- 
tic U-shape anisotropy moving from collapsed or elongated 
(a ~ 0, 180) to more open (a ~ 90) triangles. This effect 
is driven by the velocity dispersion of galaxies inside viri- 
alized structures. GS05 demonstrated that this U-shape is 
universal, being only very weakly dependent on scale, the 
primordial spectral index, or the values of the cosmologi- 
cal parameters. GS05 further demonstrated that this feature 
should be detectable in current galaxy surveys, even if the 
measurements are affected by shot-noise or if galaxies are 
biased tracers of the mass. On larger scales, the impact of 
velocity dispersion on the form of Q3 is reduced, with the 
consequence that the U-shape tends towards more of a V- 
shape and approaches the (real space) perturbation theory 
prediction (see fig. 2 in GS05). 

2.3 Theoretical interpretation 

In order to interpret our measurements of Q3 for galaxies, we 
will compare them with theoretical predictions for dark mat- 
ter in a ACDM universe. We first explain the form expected 
for the three point function of dark matter (§2.3.1), before 



introducing a notation to quantify the differences found be- 
tween galaxy and dark matter Q3 measurements (§2.3.2). 



2. 3. 1 The three point function of dark matter 

We shall denote the value of Q3 for the dark matter by Q™. 
The theoretical predictions for Q™ are relatively insensitive 
to the precise values of the cosmological parameters, but 
have a strong dependence on the local spectral index, n, of 
the linear perturbation theory power spectrum, P(k), where 
n = d log P/d log k (eg. see fig. 9 and fig. 10 in Bernardeau 
et al. 2002). This is also the case in redshift space (see fig. 4 
of GS05), but where the dependence is however weaker. On 
the scales of interest to the present work, a change in the 
local spectral index of An translates roughly to a change 
in the mean amplitude of Q 3 M by AQ 3 ~ AS3/3 ~ An/3 
(see Juszkiewicz, Bouchet & Colombi 1993). As an illus- 
tration, the difference in the shape of the power spectrum 
between CDM models with density parameters of Q m = 0.7 
and Q. m — 0.2 is approximately An ~ 0.6 on the scales of 
interest here, and so the change in Q™ between these mod- 
els is small, AQ 3 ~ 0.2 (in good agreement with fig. 4 of 
GS05). The relative insensitivity of Q™ to changes in the 
CDM power spectrum is important as it strengthens any 
conclusions we reach about differences between the value of 
Q 3 measured for galaxies and the predictions for the dark 
matter. The current levels of uncertainty on the matter den- 
sity parameter, fi m and the primordial spectral index n s are 
around the 10% level or better, and so the predicted value of 
Q-f is tightly constrained (e.g. Percival et al. 2002; Tegmark 
et al. 2004; Sanchez et al. 2005, in prep.). 

It is also possible to use an empirical approach to esti- 
mate Q™, without appealing directly to the ACDM model. 
If we assume that the two-point function of galaxy clus- 
tering has the same shape as that of the underlying mass, 
then the measured correlation function or power spectrum of 
galaxies can be used to infer the spectral index of the dark 
matter. The two-point correlation function and the power 
spectrum of galaxy clustering have both been measured for 
the 2dFGRS on large scales (Percival et al. 2001; Hawkins 
et al. 2003; Cole et al. 2005). It turns out that the shapes 
of these clustering statistics are compatible with the predic- 
tions of the ACDM model. The uncertainties in n are small 
(An < 0.1) compared to the sampling errors in the measure- 
ments of Q 3 for the 2dFGRS (GS05). We will therefore as- 
sume the concordance ACDM model, specified by Q m — 0.3, 
Aa — 0.7 and h ~ 0.7, to generate predictions for Q™, and 
neglect the impact of any uncertainty in these parameters. 



2.3.2 Comparing the three point functions of galaxies and 
dark matter: the implications for bias 

The Q3 value measured for galaxies may be different from 
the theoretical predictions for the dark matter, Q™. We 
adopt a particularly simple scheme to quantify any such dif- 
ferences: 



(6) 



Two numbers specify the difference between the measured 
and predicted Q3: a shift or offset, C, and a scaling, B. The 
simple ansatz given in Eq.|S|is general and can, in principle, 
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be applied on any scale. However, the interpretation of the 
numbers B and C does depend upon the scale under con- 
sideration. Furthermore, we should caution that this model 
may not necessarily always provide a good description of the 
transition from the clustering of the dark matter to galaxies. 

The form we have chosen is motivated by perturbation 
theory, which applies on scales for which the correlations 
are small, i.e. £ < 1. Fry & Gaztanaga (1993) modelled 
fluctuations in the density of galaxies, So, as a local, non- 
linear expansion of fluctuations in the mass distribution, 5: 

S G = F[S]^ b i sk - (7) 

k 

This formalism can be used to derive a relation between 
the three point function of galaxies and mass (see Fry 
& Gaztanaga 1993; Frieman & Gaztanaga 1994). On the 
weakly non-linear scales for which this transformation is a 
reasonable approximation, B ~ bi and C ~ C2 = 62/61: 

Q 3 ^ (Q D 3 M + c 2 ). (8) 

In this case, the shift by C can be interpreted as a non- 
gravitational contribution to Qjf and B is a simple linear 
bias scaling. These effects can become degenerate if Q3 is 
approximately constant or when the measurement errors be- 
come large. Nevertheless, it is possible, in principle, to com- 
pare the shape of Q3 measured for galaxies to that predicted 
for the dark matter, and so constrain B and C separately. 
Norberg et al. (2005; Paper I, in prep.) use the two point cor- 
relation function to obtain a working definition of the scale 
marking the approximate boundary between the non-linear 
and weakly non-linear regimes; they propose that weakly 
non- linear scales < 1) correspond to pair separations 
of > 9 ft -1 Mpc, whereas the non-linear regime (£ > 1) is 
reached when r < 6 h" 1 Mpc. 

From its definition in Eq. 21 Q3 is independent of the 
amplitude of fluctuations on large scales. We can therefore 
use the value of B to constrain the amplitude of fluctuations 
in the dark matter. In this approach, we take the two-point 
correlation function measured for galaxies and divide this 
by B 2 to obtain an empherical two-point function estimate 
of the dark matter. Then, after measuring the actual shape 
of the two-point function of the dark matter distribution 
from simulations, we can constrain the rms linear variance 
in spheres of radius 8/i _1 Mpc, as by equating our empiri- 
cal dark matter estimate to the actual value. This method 
for constraining erg relies upon several approximations and 
assumptions that we have tested successfully using N-body 
simulations (see Norberg et al. 2005, Paper I, in prep, for 
a full description of the method). Similar approaches have 
already been attempted using the skewness of the distribu- 
tion of galaxy counts-in-cells, 53 (Fry & Gaztanaga 1993; 
Gaztanaga 1994; Gaztanaga & Frieman 1994), the bispec- 
trum (eg Frieman & Gaztanaga 1994; Fry 1994; Scoccimarro 
1998; Verde et al. 2002) and the angular 3-point function 
(Frieman & Gaztanaga 1999). 

2.4 The estimation of the three point function 

To estimate the three point correlation function efficiently 
for the 2dFGRS, we use the fast grid based algorithm in- 
troduced by Barriga & Gaztanaga (2002). GS05 presented 



further tests of this algorithm using a wide range of numeri- 
cal simulations and mock catalogues. These authors demon- 
strated that special attention should be paid to the grid 
dimension employed in order to obtain robust estimates of 
the three point function in redshift space. For practical rea- 
sons, we use a somewhat lower than ideal pixel resolution in 
the estimation of the three point function from the 2dFGRS. 
This results in some smoothing of the U-shape in Qz{a) for 
collapsed configurations (compare fig. 5 of GS05 with our 
Fig. 5). As we use the same pixelization in the analysis of 
the mocks and dark matter theory, this loss of resolution 
does not affect our conclusions, although it could result in 
slightly less than optimal constraints on B and C. 

The final 2dFGRS catalogue contains some incomplete- 
ness which is quantified by the spectroscopic completeness 
mask (Norberg et al. 2002b). The spectroscopic complete- 
ness of the final 2dFGRS is much more uniform than that 
of the 100K release or the samples used in earlier cluster- 
ing analyses by the 2dFGRS team (e.g. Verde et al. 2002), 
as shown by fig. 3 of Cole et al. (2005). We reject pixels 
on the sky for which the spectroscopic completeness is less 
than 50%. We account for the remaining incompleteness by 
applying a weight to the galaxy cell density. Further details 
about the 2dFGRS catalogue are given in Section 3.1. 



2.5 Constraining model parameters using Q3 

The values of Q3 measured for different opening angles are 
correlated. This needs to be taken into account when using 
measurements of Q 3 to place constraints on model parame- 
ters, such as the values of B and C defined by Eq. |S] GS05 
introduced an eigenmode approach to parameter fitting with 
Q3, and used this to demonstrate the level of the constraints 
on B and C that could be expected from the 2dFGRS. GS05 
estimated the covariance matrix for Q 3 (a) using the mock 
2dFGRS catalogues that we describe in Section 13.21 They 
then obtained the inverse of the covariance matrix using the 
Singular Value Decomposition method. In this approach, 
eigenmodes that fall below some specified signal-to-noise 
(S/N) threshold are discarded. The likelihood contours in 
the B — C plane are specified by S\ 2 computed using the 
eigenvectors above the S/N threshold. The S/N values that 
we estimate are not quite optimal, because we use a finite 
number of mock catalogues. Our errors are therefore conser- 
vative estimates. The S/N values indicate the significance 
of the measurement of Q3 (i.e. the number of standard de- 
viations that the signal is above the noise). However, the 
S/N ratio does not translate directly into the size of the 
likelihood contours in the B — C plane, because the degen- 
eracy between these parameters also depends on how far the 
measured Q 3 deviates from a constant as a function of an- 
gle. Even in the case of an infinite S/N, B and C will be 
degenerate if Q 3 is independent of angle (i.e. see Eq. |SJ. 



3 THE GALAXY CATALOGUES 

In this section, we describe the 2dFGRS data that we use 
to measure Q3 (§3.1), and the synthetic catalogues that are 
employed to perform our error analysis and make the ACDM 
predictions (§3.2). 
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3.1 The 2dFGRS data 

Our starting point is the final 2dFGRS catalogue (Colless 
et al. 2003; a full description of the construction of the sur- 
vey is given in Colless et al. 2001). The 2dFGRS consists 
of 221,414 unique, high quality galaxy redshifts, with a me- 
dian redshift of z ~ 0.11 to the nominal extinction cor- 
rected magnitude limit of &j w 19.45. Colour information 
is now available for the 2dFGRS through the addition of 
rF-band photometry (see Cole et al. 2005). In our analysis, 
we consider the two contiguous regions of the survey which 
lie towards the directions of the SGP and NGP, covering a 
solid angle of approximately 1200 square degrees. The red- 
shift completeness of the survey varies with position on the 
sky. Colless et al. (2001) and Norberg et al. (2002b) describe 
a strategy for dealing with this incompleteness in clustering 
studies. We restrict our attention to regions of the survey 
for which the spectroscopic completeness exceeds 50 %. We 
note that the typical completeness for the final survey is 
much higher than this (~ 85 %). 

We follow the approach adopted in several previous 
clustering analyses of the 2dFGRS and construct volume 
limited samples from the magnitude limited redshift cata- 
logue (Norberg et al. 2001; Norberg et al. 2002a; Baugh et al. 
2004; Croton et al. 2004a,b). This greatly simplifies the es- 
timation of the clustering signal, as then the only variations 
in number density across the galaxy sample will be due to 
the presence of large scale structure. A volume limited sam- 
ple is defined by an interval in absolute magnitude, and this 
translates into a minimum and maximum redshift. For each 
galaxy an absolute magnitude is computed using its redshift 
and apparent magnitude, and assuming the band-shift and 
evolutionary correction (k + e) advocated by Norberg et al. 
(2002b) (see also Cole et al. 2005) . In a volume limited sam- 
ple, each galaxy could in principle be displaced to any depth 
within the sample and would still remain within the appar- 
ent magnitude range of the survey. In this paper we consider 
the samples listed in Table Q which correspond to samples 
1-4 as listed in Table 1 of Croton et al. (2004b). As in Cole 
et al. (2005), we also split the samples by restframe bj-rF 
colour into blue, bj-rF<1.07, and red, bj-rF>1.07, subsam- 
ples. 

Baugh et al. (2004) and Croton et al. (2004b) both point 
out the impact of two superstructures, one in each of the 
NGP and SGP regions, on the estimation of the moments 
of the distribution of counts-in-cells from the 2dFGRS. The 
SGP structure is at a ~ 13hr and d ~ 240ft~ 1 Mpc and 
the NGP structure is at a ~ 0.5hr and d ~ 325/i _1 Mpc 
(see fig. 1 of Baugh et al. 2004). They found these overden- 
sities were particularly influentual on measurements made 
from the L* volume limited sample, i.e. for galaxies with 
—20 <Mbj — 51og 10 h< —19; fainter volume limited samples 
do not extend to the distance of the superstructures and 
brighter samples cover a larger volume and thus dilute the 
contribution of the structures. In this paper, we follow the 
approach of these authors and in Section 14.41 present mea- 
surements of the three point function made when masking 
out the regions containing these superstructures. It turns out 
that this exclusion removes only a small fraction of the to- 
tal L* volume, approximatly 2 %, along with the fewer than 
5 % of the total galaxies contained within it. This exercise 
is merely intended to be illustrative. We are not proposing 



Table 1. Properties of the combined 2dFGRS SGP and NGP 
volume-limited catalogues (VLCs). Columns 1 and 2 give the faint 
and bright absolute magnitudes that define the sample. Columns 
3 and 4 give the number of galaxies in each sample and the mean 
number density. Columns 5 and 6 state the minimum and max- 
imum comoving distances that bound each sample for the nom- 
inal apparent magnitude limits of the survey. All distances are 
comoving and are calculated assuming standard values for the 
cosmological parameters (Qq = 0.3 and Ha = 0.7). 



Mag. 
Mb., - 


range 
5 log 10 h 


N G 


Pave 
10~ 3 //i -3 Mpc 3 






-17.0 


-18.0 


8038 


10.9 


24.8 


169.9 


-18.0 


-19.0 


23290 


9.26 


39.0 


255.6 


-19.0 


-20.0 


44931 


5.64 


61.1 


375.6 


-20.0 


-21.0 


33997 


1.46 


95.1 


537.2 


-21.0 


-22.0 


6895 


0.11 


146.4 


747.9 



that the removal of these structures should be thought of as 
a correction to our measurements, but rather should serve 
as an indication of the magnitude of systematic effects in 
the estimation of higher order statistics from surveys of the 
size of the 2dFGRS. 



3.2 Mock catalogues 

Mock catalogues play an important role in our analysis. 
They are used to compute errors on our measurements and 
also as a means of generating the predictions of the ACDM 
model, taking into account the selection function of the 2dF- 
GRS. Following GS05, we construct the (normalized) covari- 
ance matrix for the measurements of the three point function 
using an ensemble of 22 synthetic 2dFGRS catalogues ex- 
tracted from the ACDM Hubble Volume N-body simulation 
(Evrard et al. 2002) . The construction of these catalogues is 
described by Norberg et al. (2002b). The mock catalogues 
have the same radial and angular selection function as the 
2dFGRS, and have been convolved with the completeness 
mask of the survey. A simple phenomenological prescription 
has been applied to the final density field in the simulation 
in order to extract points with a clustering amplitude that 
is a modulated version of the clustering of the underlying 
dark matter (Cole et al. 1998). We also use the dark matter 
from the Hubble Volume simulation to generate dark mat- 
ter predictions for the concordance ACDM model in redshift 
space. 

The mocks were not constructed to match higher order 
clustering statistics, as the biasing model used (see Norberg 
et al. 2002b; Cole et al. 2005) was tuned only to reproduce 
the 2-point correlation function of all galaxies (as measured 
for the 2dFGRS by Hawkins et al. 2003). In particular, this 
means that the mock galaxy catalogues do not display lu- 
minosity dependent clustering, as seen in the data (Norberg 
et al. 2001). This deficiency can be turned around to provide 
an interesting test of our analysis, since the mocks should al- 
ways give, for a fixed triangle configuration, the same mean 
(33(a) for different luminosities, regardless of the volume 
under consideration or the density of galaxies. 
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4 RESULTS FOR Q 3 

In this section we present our measurements of Q3 for dif- 
ferent 2dFGRS galaxy samples, defined by luminosity and 
colour. In the first two subsections, we consider general tri- 
angle shapes, focusing first on scales for which we expect 
the clustering to be in the weakly nonlinear regime ( H4.1H 
before considering the nonlinear regime f t|4.2H . The reason 
behind this split is that the interpretation of our measure- 
ments is quite different in these two cases, as discussed in 
section |2~31 In both sections, we consider how our measure- 
ments depend upon galaxy luminosity and, in the case of 
the nonlinear regime, on colour as well. In H4.3I we consider 
the special case of equilateral and elongated triangles, which 
give a cleaner measure of the physical scale dependence of 
Q3. Finally, in >14.4I we discuss the influence of large struc- 
tures on the measurement of the three point function in the 
2dFGRS. From now on, we use the shorthand notation M£ 
to denote M bj — 5 log 10 h. 

4.1 Q-i(ct) in the weakly non-linear regime 

The —21 <M{J J < —20 volume limited sub-sample yields the 
highest signal-to-noise measurements of Q3 in the weakly 
non-linear regime. Samples brighter than this have too low 
a galaxy density to permit robust measurements on such 
scales, while fainter samples span smaller volumes, resulting 
in a larger sampling variance. We will therefore describe 
the results for this sample in some detail, as this will serve 
to make several basic points that can be applied to other 
samples. 

4.1.1 Q 3 (a) for -21 <M^< -20 

In the top row of Fig. [5] we show Qz{a) for galaxies with 
—21 <M{J J < —20 and different triangle configurations: ri2 = 
9 and r i3 = 9, 18 and 27 h _1 Mpc, from left to right re- 
spectively. The middle panel illustrates the scatter expected 
in such a measurement of Qi(a), obtained using the mock 
galaxy catalogues. Here, the prediction for the ACDM dark 
matter model is shown by the solid line while the dashed 
line shows a biased version of this, computed by inserting 
B = 1 and C = —0.3 into Eq. |S| In the bottom panels, we 
show the likelihood contours for B and C derived from fit- 
ting the observed Qz[pt) to the ACDM prediction with the 
theoretically motivated relation given by Eq. [5] 

In the top panels of Fig. |3Jwe clearly see, for the data, 
the characteristic dependence of Q3 on a, with the V-shape 
becoming more pronounced as larger scales are considered. 
Note how the variation in the shape of Q3 with scale seen 
in the 2dFGRS data is mimicked by the dark matter predic- 
tions and by results for the mock catalogues. 

The middle panels of Fig. [5] show how closely Q3 esti- 
mated from the mock catalogues agrees with the measure- 
ments from the 2dFGRS. This agreement is all the more 
remarkable when one recalls that a match to Q3 was not re- 
quired in the construction of the mocks. Another noteworthy 
feature of the results for the mocks is the strong covariance 
that is apparent between the measurements of Q3 in different 
angular bins. Hence, to perform a meaningful fit to Qs(a), 
there is a clear need to decompose the measurement into 
statistically independent Q-eigenmodes, yielding a basis in 



Table 2. The best-fitting 61 and c 2 values for the -21 <M£ j < 
—20 sample for a range of weakly non-linear scales, along with the 
associated marginalized 68 % confidence intervals (corresponding 
to 1-cr if one assumes Gaussian statistics), as taken from Fig. HI 
Each entry in the table uses different triangle configurations (indi- 
cated in brackets in the first column), probing the range of scales 
listed in the first column. 



scale [/i -1 Mpc] 


61 


68 % C.I. 


C2 


68 % C.I. 


6-18 (6:12) 


1.01 


[0.91,1.15] 


-0.27 


[-0.37,-0.15] 


6-27 (6:12+9:18) 


0.93 


[0.85,1.03] 


-0.34 


[-0.42,-0.23] 


9-18 (9:9, a > 60) 


0.97 


[0.88,1.09] 


-0.31 


[-0.41,-0.24] 


9-36 (9:18+12:24) 


0.94 


[0.83,1.07] 


-0.36 


[-0.45,-0.23] 



which the covariance matrix is diagonal. This strong correla- 
tion between bins in measurements of Qd,(a) was originally 
pointed out by GS05. Further details of the application of 
the singular value decomposition to the measured values of 
Q 3 (a) and the estimation of the signal-to-noise (S /N) of the 
Q-eigenmodes can be found in GS05. 

From Fig-H it is clear that the -21 <M 1 ^ < -20 sample 
provides a high quality measurement of Qs(a) in the weakly 
non-linear regime. The characteristic V-shape dependence 
of Q3 on angle is readily apparent across a range of triangle 
scales. With such a high S/N of the Q-eigenmode decom- 
position, this volume limited sample is expected to provide 
strong constraints on the parameters B and C in Eq. |5] we 
recall that in the weakly non-linear regime, B ~ 61 and 
C ~ C2. The constraints on these parameters, shown in the 
bottom row of Fig. 01 are discussed below in section 14.1.31 



4-1.2 Q3(a) as function of luminosity 

Staying in the weakly non-linear regime, we now con- 
sider the dependence of Qzia) on galaxy luminosity. In 
Fig. [3] we present, for the triangle configuration ri.3 = 
2 ri2 = 16 h" 1 Mpc, the variation of Q 3 (a) with luminos- 
ity, as measured from volume limited catalogues defined by 
-21 <M£ T < -20 and -19 <Mjj ] < -18. On these weakly 
non- linear scales the best S/N, as indicated in the bottom 
panels of Fig. [3] again occurs for the —21 <M bj < —20 sam- 
ple, as expected from the analysis of GS05. 

Fig.[3]shows that the characteristic Qz(a) shape is seen 
for both bright and faint galaxies. Moreover, the V-shape 
is essentially the same in the two samples, within the mea- 
surement errors. Given the size of the errors, we are not yet 
able to detect any clear evidence of luminosity segregation 
in Qz(a) in the weakly non-linear regime. Unfortunately, we 
encounter the same limitation in section 14.31 when consid- 
ering large equilateral triangles. 

From the middle panel of Fig. [3J we conclude that 
our measurement of Qs is robust to sampling variance and 
volume effects. Reassuringly, the results obtained from the 
mock catalogues for Qs(a) for different volume limited sam- 
ples are, within the errors, equivalent as they should be by 
construction. The bottom panel of Fig. |H] which presents 
the constraints on B and C (Eq. |SJ , is discussed in the next 
section. 
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Figure 2. Q3 in the weakly nonlinear regime. The upper row shows measurements from the 2dFGRS —21 <Mjj j < —20 volume limited 
sample. Different columns show the results for different triangle sizes, as indicated by the legend. The squares show the mean value of Q3 
as a function of a and the error bars are derived from the scatter in the mock catalogues with the same magnitude limits. The thick solid 
curves in the upper two rows show the predictions for Q3 in the ACDM model. The thick dashed curves in these panels shows the effect 
of applying a transformation (Eq. |SJ to this prediction corresponding to B = 1 and C = —0.3. The thin solid lines in the middle row 
show the mean Q3 measured in individual mocks. In the bottom row, we show the constraints on B and C, derived from an eigenmode 
analysis. The four contours shown from outside in correspond to x 2 = 11,8,6.17,2.3 (ie 99.7%, 95.4% and 68.3% confidence interval for 
2 parameters) and \ 2 = 1 (ie 68.3% for one of the parameters). The cross point shows B = 1, C = for reference. 



4.1.3 Constraints on bi & c 2 from -21 <M£ T < -20 

The bottom panels of Figs. [5] and [3] present the likelihood 
contours for the bias model parameters B and C, as de- 
fined in Eq. |S| which relates Qi{a) for galaxies to Q™(a) 
for the dark matter. Remember that in the weakly non- 
linear regime, B — bi and C = C2, the conventional linear 
and non-linear biasing terms respectively. The cross-circle 
point in the bottom panels of Figs. [5] and |3| shows an unbi- 



ased ACDM prediction (i.e. 61 = 1 and c% = 0). For some 
configurations, such as the right bottom panel of Fig. |S| 
the likelihood contours are broad and the bias parameters 
are poorly constrained. In this particular case (i.e. the faint 
galaxy population), this is chiefly a result of the small vol- 
ume considered, telling us that these larger-scale triangle 
configurations do not sample enough different environments 
in this volume. 



8 Gaztanaga et al. 




50 100 150 50 100 150 

a a 




12 3 12 

B B 



Figure 3. Same as Fig. [5] but for different volume limited sam- 
ples: -21 <M{j < -20 (left) and -19 <M£ t < -18 (right). In all 
panels, we consider the case where ri3 = 2 na = 16 h~ 1 Mpc. 



By combining measurements on different weakly non- 
linear scales, we can improve the constraints on bi — C2 for 
the -21 <Mb T < -20 sample, as shown in Fig. Tabled 
lists the corresponding marginalized best-fitting values. In 
this regime, we do find a slight trend, with a decrease of the 
bias parameters with increasing scale, although the trend is 
not very significant and is within the quoted 1-a error. The 
strongest constraints on b\ — ci come from the 6—27 h~ x Mpc 
configuration (bottom right panel in Fig^J, with measured 
values of fei = 0.931^8 and c 2 = — 0-34±g;J|. 

Finally, we note that in all the panels of Fig. [I] the 
unbiased ACDM prediction (shown by the cross-circle) is 
strongly excluded by the data. For example, for ri3 = 
2 r i2 = 18 h' 1 Mpc, Ax 2 > 80 for 2 degrees of freedom 
(which implies a disagreement in excess of 9-<r). Despite the 
correlation between bi and C2, the significance of the detec- 
tion of bias, i.e. values away from bi — 1 and C2 = 0, is 
much larger than is apparent from just adding the errors 
in quadrature or using the values in Table [5] with a square 
errorbox. In fact, the measured values of bi and C2 seem to 
follow a degenerate line, illustrated in Fig. |3] that avoids 
bi — 1 and C2 = for all scales and luminosities. As the 
scale or the luminosity of the sample decreases, 6i crosses 
unity and C2 passes through zero (though not at the same 
time!), following bi ~ C2 + 1.2 and hence avoiding the unbi- 
ased prediction. 
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Figure 4. The combined constraints on B and C, in the weakly 
non-linear regime, using different triangle configurations and the 
joint covariance of Qs(a) for the 2dFGRS volume limited sample 
-21 <M£ < -20. Each panel uses different triangle configura- 
tions (as indicated by each legend). For isosceles triangles with 
ri2 = ri3 = 9 h~ 1 Mpc, we fit only for a > 60 degrees, to en- 
sure that only weakly non-linear scales (9 to 18 h~ 1 Mpc) are 
considered. 
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Figure 5. Qs{oi) in the non-linear regime for two different lu- 
minosities (increasing from bottom to top) and different scales 
(decreasing from left to right), as indicated in the legend in each 
panel. The biased model (dashed lines) has B = 1 and C = —0.3, 
whereas Q-j for ACDM is plotted with a solid line. 
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Figure 6. Constraints on the model parameters B and C, for the 
samples presented in Fig. 151 

4.2 Qz(ot) in the non-linear regime 

In this section, we consider triangle configurations 
which probe non-linear clustering scales corresponding to 
^ 6 /i -1 Mpc (see section ll.'il . We first discuss results for 
three different triangle configurations using the -19 <M£j< 

— 18 volume limited catalogue ( >I4.2,1I| . This is found to 
be the optimal sample with which to study the non-linear 
regime due to its relatively high galaxy number density in a 
volume that is large enough to account for small-scale cosmic 
variance. Thus, this sample is the one least affected by shot 
noise 1 . Then, in H4.2.2I we consider the variation of Q-z(a) 
with luminosity and colour on non-linear scales, followed by 
a study of the constraints on the model parameters B and 
C (Eq.EJ in 34~2~31 

4.2.1 Q 3 (q) for -19 <M| 3 1 J < -18 

The bottom row of Fig.EJshows Q 3 (a) for the -19 <M£ T < 

— 18 sample with three triangle configurations which probe 
non-linear scales. From left to right, we have ri3 = 2 ri2 
with ri2 = 6, 3, and 1.5 ft -1 Mpc, respectively. For these 
configurations, the characteristic U-shape of Qs{a) is clearly 
visible. The l-cr errors are found to increase on larger scales 
due to the reduced number of independent triangle config- 
urations that one can fit within the sample. The S/N for 
these triangle configurations are 8, 22, and 18 respectively, 
as indicated in the bottom row of Fig. [(J Such values are 
not exceptional, but nevertheless are sufficient to allow use- 
ful constraints on B and C to be determined. These model 
constraints are further discussed in section Pi. 2. 31 

1 The L* sample is actually statistically better than this fainter 
one, but suffers from a much larger systematic uncertainty due to 
the presence of the two superstructures (see section [4.4l . 



Table 3. The best-fitting Qg and slope of Eq. [5]for differ- 
ent configurations (equilateral and elongated), for different galaxy 
populations (All, Red and Blue) and dark matter (DM). 



Triangle 


Sample 


Ql 


oil 


x 2 


equilateral 


All 


0.70 ±0.02 


-0.15 ±0.04 


3.1 


equilateral 


Red 


0.74 ± 0.04 


-0.34 ±0.11 


1.2 


equilateral 


Blue 


0.62 ± 0.03 


-0.06 ±0.07 


4.3 


equilateral 


DM 


0.97 ±0.03 







elongated 


All 


0.99 ± 0.04 


-0.06 ±0.09 


1.7 


elongated 


Red 


1.07 ±0.02 


-0.37 ±0.03 


0.2 


elongated 


Blue 


0.96 ± 0.03 


0.08 ± 0.08 


1.2 


elongated 


DM 


1.46 ± 0.03 








4-2.2 Q3,{a) as function of luminosity and colour 

We use bright (-21 <M^< -20) and faint (-19 
— 18) volume limited samples to investigate the luminosity 
dependence of Q 3 (q) in the non- linear regime. This is pre- 
sented in Fig.H5](top and bottom panels) for different triangle 
configurations (from left to right). In Fig.^j] we show the cor- 
responding likelihood contours for the model parameters B 
and C from Eq.|S| Comparing the results between the faint 
and bright galaxy samples in Fig.F5] we see that, even though 
the characteristic U-shape is generally preserved, there is a 
weak tendency for the amplitude of Q3 (a) to decrease with 
increasing galaxy luminosity. In terms of the model param- 
eters B and C, Fig. |S| suggests that changing the character- 
istic luminosity of the galaxy population results in a shift in 
the best-fitting model. 

This general behaviour is better quantified in Fig. |7| 
which shows how Q3 changes with both colour and luminos- 
ity. The red and blue colour samples are sub-populations of 
each volume limited sample, split by rest frame bj — rp colour 
at 6j— tf = 1-07 (Cole et al. 2005). We focus our attention on 
two characteristic configurations: equilateral triangles, with 
q — 60 degrees, and elongated triangles with a ~ 180. We 
choose a common scale around ri2 — J~2 3 — 4 h~ 1 Mpc, 
where all samples yield a good detection of Q3. To im- 
prove the signal-to-noise, we take a large a bin, Aa = ±18 
degrees, and a large ri2 bin, Ari2 = ±1 /i -1 Mpc, com- 
pared to our standard choices of Aa = ±5 degrees and 
Ar w = ±0.6 h' 1 Mpc. 

The results in Fig.f7|are well fit by a linear relation that 
is a function of absolute magnitude: 

Qz = Ql + a L log 10 -L = Ql- 0.4 a L (M - M*) . (9) 

Table PJ1 shows the best- fitting values for the parameters Q3 
and oll using a x fit with the full covariance matrix. As 
shown by the \ 2 values in the table, the functional form 
given in Eq. provides a very good description of these 
measurements. As seen in Table fJl and also in Fig. |7| there 
is weak evidence for luminosity segregation in both the blue 
and red populations. Interestingly, the small but significant 
trend for the overall population can be fully attributed to 
the luminosity segregation in the red galaxies. 
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Figure 7. Values of Q3 for equilateral (upper panel) and elon- 
gated (lower panel) triangles, as a function of luminosity. Mea- 
surements with errorbars show different galaxy samples: all galax- 
ies (circles), red galaxies (triangles) and blue galaxies (squares). 
The solid, short-dashed and long-dashed lines show the corre- 
sponding best linear fit to Eq.[5] with the best-fitting values of the 
parameters quoted in Tabled] The dotted horizontal line shows 
the corresponding ACDM prediction. 



4-2.3 Constraints on B & C from non-linear scales 

Fig. [SJshows, as symbols with errorbars, the best-fitting val- 
ues for B and C (see Eq. |SJ in the non-linear regime us- 
ing all isosceles triangle configurations with ri2 — J~23 — 
4 Mpc, i.e. not just the elongated and equilateral cases 
considered in Fig. |7] We find weak systematic trends in 
the values of these parameters with luminosity, with fainter 
galaxies favouring larger values of B and C. All samples, 
except the brightest, are at least 1 or 2-<r away from the 
unbiased ACDM case (i.e. 5 = 1 and C = 0). Fig. |H1 also 
indicates, using lines, the corresponding values of B and C 
when we use the best linear fit to the Q3 data as a function 
of luminosity, as listed in Table |3] These lines emphasise the 
monotonic behaviour of the data and provide some addi- 
tional idea of the uncertainties. 

We note that if one took the naive interpretation of B 
as the linear bias parameter, 61, then the behaviour shown 
in Fig. [S] is exactly the opposite to that previously reported 
from the analysis of the 2-point correlation function, £2, over 
similar volume limited samples (Norberg et al. 2001, 2002a, 
2005). However, in our case here, the measurement of the pa- 
rameter B is done in the non-linear clustering regime, where 
B can no longer be interpreted as the linear bias parameter. 
This is why our notation explicitly differentiates between B 
and b\, and explains our choice to split the analysis into 
the two distinct clustering regimes. On small scales, we ex- 
pect strong corrections to the linear relation £2 — &i£:p M 
and similarly to Eq. |S] Indeed, the expansion used in Eq.Q 
is only valid in regimes where the density fluctuations are 
small with 8 < 1. Thus, the fact that B, as measured from 



Figure 8. Deviations of Q3 from the ACDM prediction as quan- 
tified by B and C (see Eq. [5] recall that, for the dark matter, 
B = 1 and C = 0, as shown by the horizontal dotted line in each 
panel). Results are shown for all configurations of triangles with 
7*12 ~ T23 ~ 4 h _1 Mpc, as a function of luminosity. Circles, tri- 
angles and squares with errorbars correspond to all, red and blue 
galaxies respectively. Continuous, short-dashed and long-dashed 
lines show the corresponding predictions for all, red and blue 
galaxies using the best-fitting parameters listed in Table and 
Eq.|5| 

Qs(a), decreases with increasing luminosity does not neces- 
sarily mean that £2/^2 should decrease for brighter galax- 
ies. Once we are in the non-linear regime, B and C can only 
be understood in terms of their effect on (33(a), ie. a shift 
and scale modification of the Q™(a) dark matter configura- 
tion, and not in terms of the theoretically motivated relation 
given by Eq. |7| Even with these interpretative restrictions, 
the values of B and C that we recover provide an accurate 
description of how biasing changes Q™(a) and so give inter- 
esting new constraints on models of galaxy formation. 

4.3 Qi as function of scale 

We present in Fig. [5] results for Q3 using equilateral and 
elongated triangle configurations for the —19 <Mjj j < —18 
and —21 <M£ J < —20 volume limited samples. These two 
configurations correspond to isosceles triangles with a = 60 
degrees and a = 180 degrees respectively. The equilateral 
configuration has the nice property that it can be fully 
characterized by just one triangle side length, so that each 
triangle samples a unique scale. Fig. |U] shows that for the 
equilateral configuration (upper panel), Q3 displays little 
scale dependence, as opposed to the elongated case (lower 
panel), which shows a rather strong scale dependence, much 
stronger than the dark matter prediction for similar triangle 
configurations. In addition, on non-linear scales, the equi- 
lateral configuration also provides clear evidence for lumi- 
nosity segregation, although on weakly non-linear scales the 
results are currently too uncertain to continue making such 
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Figure 9. Left panels: the upper and lower panels show, as a function of the triangle side length, Qs for equilateral and elongated 
configurations respectively. Squares and dots correspond to the —19 <Mg < —18 and —21 <Mj^< —20 galaxy samples respectively. 
For equilateral triangles, we detect a clear luminosity segregation on non-linear scales and no significant scale dependence for galaxies 
of a given luminosity. This is in contrast to elongated triangles, where a strong scale dependence is observed but with no indication of 
luminosity segregation on any scale. The dotted lines in each panel show the ACDM prediction for Q3(ri2). Right panels: Best-fitting B 
and C values for the combined configurations shown in the left panel, for the —21 <Mjj < —20 (dots) and —19 <Mj^< —18 (squares) 
samples. Note the change to a linear scale in the x-axis. 



a claim. This, unfortunately, mirrors our conclusions from 
section 14.1.21 using other triangle configurations. Elongated 
measurements of Q3, in contrast, display no luminosity de- 
pendence at all. 

From Fig. |3J we conclude that both triangle configu- 
rations, when probing weakly non-linear scales, are in rea- 
sonably good agreement with the ACDM prediction, favour- 
ing nevertheless a slightly negative value for C. This is in 
good agreement with the results presented in Table [5] On 
non-linear scales, the difference between the data and the 
dark matter prediction is striking and will provide a pow- 
erful constraints on models of galaxy formation. Note that 
the results presented in Fig.[5]are fully consistent with those 
discussed in §4.1.3. The aim of the exercise of considering 
isoceles triangles is to isolate the dependence of Q3 on scale. 
To achieve this, the binning of Q3 as a function of a is de- 
graded and the range of values of a considered on weakly 
non-linear scales is reduced when compared with the ear- 
lier, less restrictive analysis in §4.1.3 (which considered also 
triangles with ri3 > ri.2 and better a resolution). As a con- 
sequence, the resulting constraints on the parameters B and 
C presented in this subsection have larger errorbars. 

4.4 The influence of superstructures on Q3 

The measured Q3 for the L* sample, i.e. galaxies with ab- 
solute magnitudes in the range -20 <Mj 3 1 ,< -19, can be 
strongly influenced by the presence of two superstructures 
in the 2dFGRS, depending on the scale measured. The im- 
pact of these structures on the distribution of counts-in- 
cells in the L + sample was first pointed out by Baugh et al. 



(2004). Baugh et al. presented results both with and without 
the superstructures, to illustrate the systematic effect that 
their presence has upon the clustering measurements. For 
the L* sample, the superstructures were found to dominate 
the clustering statistics on scales larger than « 6 hT 1 Mpc, 
and we find that these structures have a comparable influ- 
ence in our analysis, as shown in Fig. llOl fin general the influ- 
ence of these structures can be seen out to the largest scales 
that can be probed). On smaller scales (ri3 < 6 h" 1 Mpc), 
and for the other volume limited samples that we consider, 
either the systematic contribution from these large coher- 
ent structures lies within the (correlated) errorbars from the 
mocks, or, because of their spatial location, the superstruc- 
tures are not present within the volume analysed. 

The impact of the superstructures is equally pro- 
nounced in both the NGP and SGP regions, as is clear from 
the left panel of Fig. H01 Both the NGP and SGP results for 
Q3(a) (solid and dashed lines respectively) change in a sim- 
ilar way when the superstructure in each region is removed. 
As we pointed out in Section|^J the volume masked out when 
the superstructures are excised is less than 2 % of the total, 
with a loss of approximately 5 % of the galaxies. As Baugh 
et al. remarked, the act of removing the superstructures is 
not intended serve as a correction to the clustering measure- 
ments, but rather as an illustration of the systematic effects 
that rare objects produce in higher order clustering statis- 
tics. That said, the results with the superclusters removed 
do appear to be more in line with our theoretical prejudice 
for the weakly nonlinear regime. 

The difference between the results obtained for Qa(a) 
with and without the superstructures is larger than the vari- 
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Figure 10. The impact of the superstructures on the measurement of Qs{a) for the —20 <Mjj < —19 sample and the r\z = 2 ri2 = 
12 h~ 1 Mpc triangle configuration. Open and filled circles show respectively the results with and without the superstructures (SC). All 
errorbars are derived from the mocks and correspond to a 2-cr limit. Left panel: solid (dashed) lines show results for the NGP (SGP) 
region. The upper pair of lines and set of symbols show the results with the SC; the lower set without SC. Right panel: solid (dashed) 
lines correspond to blue (red) galaxies. As in the left panel, upper pair of lines and set of symbols show the results with the SC; the 
lower set without SC. 



ance over our ensemble of 22 mock catalogues. This indicates 
that the volume limited samples in the 22 mocks do not con- 
tain the large, coherent structures seen in the real data, as 
mentioned above. As Baugh et al. (2004) commented, the 
presence of such structures could give us new insights into 
models of structure formation. However, it is important to 
bear in mind that the lack of superstructures in such a small 
number of mocks does not place a very high confidence limit 
against them being seen at all in the ACDM model. All that 
we can conclude from the non-detection of such objects in 
our ensemble of synthetic 2dFGRS catalogues is that they 
occur less than 5% of the time. We have carried out an 
analysis of a large ensemble of dark matter simulations, in 
addition to undertaking a more extensive search of the Hub- 
ble Volume simulation, in order to place tighter constraints 
on the frequency of such superstructures in the ACDM cos- 
mology. Before we can place firm limits on the chances of 
finding superstructures like those seen in the 2dFGRS, we 
need to make realistic mocks with the radial and angular se- 
lection of the 2dFGRS, but the preliminary indication from 
analyzing idealized, cubical volumes is that it is possible to 
find such superstructures in the simulations. Full details will 
be presented in a future paper (see also Fosalba, Pan & Sza- 
pudi 2005, who discuss the impact 10 15 solar masses halos 
on theoretical prediction of Q3). 

In the right panel of Fig. 1101 we show, for the same tri- 
angle configuration as in the left panel, the results for Qz^a) 
split by colour: red (solid) and blue (dashed). It is interesting 
to see that with the superstructures included, the difference 
between Q3 measured for red and blue galaxies is mildly 
significant (with red galaxies having a systematically larger 
Q3). In contrast, when the superclusters are excluded from 



the analysis, the measured Q3 for red and blue galaxies are 
identical to within the errors. This segregation suggests that 
the superstructures are, perhaps not unsurprisingly, popu- 
lated preferentially by red galaxies. 



5 COMPARISON WITH PREVIOUS 2dFGRS 
RESULTS 

As mentioned in the introduction, Baugh et al. (2004) and 
Croton et al. (2004) found the puzzling result that S3 ~ 2 
for 2dFGRS galaxies in contrast to the theoretical value of 
S^ M ~ 3 expected in ACDM on large (weakly non-linear) 
scales. This apparent inconsistencey can now be resolved 
using the bias parameters we have measured here, i.e. 61 ~ 1 
and c 2 ~ -0.3 gives S3 ~ (Sf M + 3c 2 )/6i ~ 2, in good 
agreement with the above mesurements on large scales. In 
their Fig. 10, Croton et al. (2004) found a weak dependence 
of S3 on galaxy luminosity, with a slope B3 ~ —0.4 detected 
with 2-cr confidence level. We note that this is in very good 
agreement with our nearly 4-cr detection of lvl — —0.15 ± 
0.04, as quoted in Table 3. Since S3 ~ 3Q3, we would expect 
B 3 ~ 3a L ^ -0.45 ± 0.12, as found in Croton et al. (2004), 
but with a higher significance. 

Measurements of three-point statistics from early 2dF- 
GRS data releases were made by Verde et al. (2002) and 
Jing & Borner (2004). These authors used compilations com- 
prising 127K and 100K galaxies respectively. Here, as re- 
marked earlier, we use the final dataset which contains dou- 
ble the number of galaxies and double the volume that were 
available for analysis in these preliminary studies. Recently, 
Pan & Szapudi (2005) have also analysed the final 2dFGRS 
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dataset, estimating the monopole moment of the three-point 
function, averaging over the shape dependence of triangles. 
In this section, we compare our results with those obtained 
by these authors and also with the measurement of the pro- 
jected Qz for the APM survey, the parent catalogue of the 
2dFGRS, made by Frieman & Gaztanaga (1999). 

Verde et al. (2002) found, using a three-point function 
analysis in Fourier space, that 2dFGRS galaxies are essen- 
tially unbiased tracers of the mass, recovering a linear bias 
factor consistent with unity, 61 = 1.04 ± 0.11, and a second 
order bias that is effectively zero, 62 = &1C2 = — 0.054±0.08. 
We note that Lahav et al. (2002) also reached a similar 
conclusion applying a different approach to the same 2dF- 
GRS dataset, arguing that 61 ~ 1. There is good agreement 
between our best value for bi and that obtained by Verde 
et al, which is encouraging in view of the possible reasons 
for discrepancies between the results of the two studies set 
out below. However, our results for the quadratic bias are 
quite different from those of Verde et al. Our optimum mea- 
surement gives a 3-<r detection of a non-zero value for the 
quadratic bias, whereas Verde et al. found a value consis- 
tent with zero. The discrepancy between our results and 
those of Verde et al. corresponds to A\ 2 > 80 for 2 degrees 
of freedom. This implies a 9-cr discrepancy (recalling that 
the errorboxes are not square, but elongated). The discrep- 
ancy in the claimed values of C2 is only 3-4 -a if we take the 
nominal errors on the measurement of ci by Verde et al. and 
assume square errorboxes. 

What are the reasons behind this disagreement? We 
have identified some aspects in which our analysis differs 
from that of Verde et al., which will contribute to the dis- 
crepancy at different levels, over and above the fact that 
we used different versions of the 2dFGRS data. First, we 
have considered the full 2dFGRS in configuration space, 
thus avoiding the need to compensate for the impact of the 
complicated 2dFGRS angular mask on measurements car- 
ried out in Fourier space. Verde et al. do not correct for 
the convolution of the underlying bispectrum with the an- 
gular survey window function, arguing that, for the range of 
wavenumbers they consider, this effect is unimportant. This 
conclusion is based upon tests carried out for the •power spec- 
trum by Percival et al. (2001). The impact of the window 
function on the bispectrum could be more extensive than 
in the case of the power spectrum, introducing anisotropies 
into the recovered bispectrum, and has not been tested ex- 
plicitly. Second, the range of galaxy luminosities considered 
is different in the two studies. We have analysed volume 
limited samples drawn from the 2dFGRS, whereas Verde 
et al. used the flux limited survey (however our best mea- 
surement comes from galaxies with luminosities between 1.3 
and 2.5 L*, and their sample corresponds to ~ 1.9 L*). 
Third, the scales used to constrain the parameters of the 
bias model are also different. We use triangles that probe 
pair separations from 9 to 36 ft -1 Mpc; Verde et al. con- 
sider 13 to 62 hT 1 Mpc, although most of their signal comes 
from the smaller scales, as shown by their fig. 4. Fourth, 
Verde et al. neglect the covariance between measurements 
of the bispectrum at different wavenumbers, which is a poor 
approximation even in Fourier space, as shown by Scocci- 
marro et al. (2001a) and Feldman et al (2001). Neglecting 
the covariance will artificially suppress the errors on fei and 
&2 by a considerable factor, corresponding roughly to the 



actual number of bins used divided by the number of domi- 
nant eigenmodes of the reduced three-point function, which 
in this case could be up to a factor of 4. This could to some 
extent explain why our relative errors are larger than those 
quoted by Verde et al., in spite of the more homogeneous 
2dFGRS dataset used in our analysis. Verde et al use mock 
catalogues to estimate the errors on the recovered values of 
61 and 62. The true, underlying value of 62 for the mocks 
is not known, so it is not possible to assess whether or not 
their method introduces any systematic biases in the recov- 
ered value of 62- A bias on &2 introduced by the convolution 
with the angular mask and the covariance in the bispectrum 
measurements could affect the estimated values of both the 
mean and the errors. In fact, the mocks used by Verde et al. 
are very similar to the ones used here as they were produced 
using the same prescription for galaxy biasing . As shown by 
the dashed lines in our Figs. 2, 3 and 5 there is a systematic 
shift of Q3 in the mocks with respect to the dark matter sim- 
ulations, indicating that &2 is in fact non zero (and negative) 
in the mocks, in contrast to fig. 2 of Verde et al. 

Our results are in somewhat better agreement with 
those of Jing & Borner (2004) and Wang et al. (2004), 
who analysed the 2dFGRS 100k release (Colless et al. 
2001). They found that Q 3 measured for the 2dFGRS is 
smaller than the ACDM predictions, particularly for galax- 
ies brighter than L*. This agrees with our result (compare 
the measurements for galaxies shown by symbols in the top 
rows of Figs. [5] and [3] with the dark matter predictions plot- 
ted using thick lines) and is also at odds with the Verde et al. 
result. Our results for equilateral configurations in FigQarc 
also in good agreement with fig. 10 in Wang et al. (2004). 
However, the comparison with these results is not straight- 
forward for a number of reasons: (i) The authors used less 
than half the data that we have analysed, (ii) They used 
a different parametrization and binning for their measure- 
ments of Q3. (iii) They neglected covariance between bins 
and used approximate bootstrap errors. Jing & Borner in- 
terpreted the lower values of Q3 that they found as a con- 
sequence of a larger linear bias, b\ ~ 1.5, in contrast to our 
conclusion that most of the bias comes from the quadratic 
term C2 — —0.3, with a linear bias consistent with unity. 
This difference has a dramatic consequence for the implied 
value of erg. For galaxies fainter than L*, Jing & Borner get 
unbiased results, which disagrees with our findings. This, 
however, could be a result of the smaller volume probed by 
Jing & Borner, which gives larger errors on their measure- 
ment. Jing & Borner also seem to find less configuration 
dependence for Q3, i.e. as function of the triangle shape 
specified by a. As pointed out in GS05, this could partly be 
due to the use of too large a bin in the a angle that param- 
eterises triangular shape in addition to the smaller volume 
used. 

Most recently, as this paper was about to be submit- 
ted, Pan & Szapudi (2005) presented new results on the 
monopole moment of the three-point function measured 
from the full 2dFGRS. They find 61 ~ 1.04±g;g| and b 2 ~ 
-0.061 

o!oi- Both the technique and assumptions employed 
by Pan & Szapudi are conceptually very different from ours. 
The monopole contribution to the normalized three point 
function merely yields a constant value that is independent 
of triangle opening angle. It is approximately equivalent to 
the first eigenmode in our Singular Value Decomposition 
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of the covariance matrix of Qz, and therefore contains much 
less information than we use to place constraints on the bias 
parameters. As a consequence, the monopole alone cannot 
be used to separate 61 from 62; only the higher multipoles of 
Qz, can break this degeneracy. Pan & Szapudi instead use a 
simultaneous fit to the amplitudes of the two (£) and three- 
point (£) functions (as a function of scale) to place separate 
constraints on the values of b\ and 62 ; recall that our analysis 
only requires a fit to the ratio Q3 ~ C/£ 2 - Both the mod- 
elling and the systematics involved in the fit used by Pan & 
Szapudi are therefore quite different from ours. Our analy- 
sis is less sensitive to possible systematics in the amplitude 
of £. In particular, we do not need to model the impact of 
redshift distortions on the amplitudes of the 2 and 3-point 
functions as Pan & Szapudi must. Another important differ- 
ence is the implicit assumption used by Pan & Szapudi that 
the biasing parameters, bi, are constant over the whole range 
of scales considered, i.e. from 4 — 60 /i -1 Mpc. In our case, 
we allow bi to change for each combination of fixed scales ri2 
and ri3. Given these differences, there is surprisingly good 
agreement in the values obtained for 61 by the two methods. 
However, their 62 value is significantly different. This is not 
unexpected given the systematic uncertainties in modelling 
redshift distortions through the /f term in eqn. [6] of their 
paper. As shown in the right-hand panel of our Fig. Eland 
in Table [5] we find a weak trend for the bias parameters to 
increase as the triangle scale is reduced. This could also help 
to explain the slightly larger biasing parameters they find. 
Perhaps more puzzling is their fig. 7, which shows how 61 
increases for the brightest galaxies, in contrast to our Fig. 7 
(which only applies to the smallest scales considered by Pan 
& Szapudi). 

Our findings are compatible with the values of the pro- 
jected Q3 measured in the APM survey, which is the par- 
ent catalogue of the 2dFGRS. Frieman & Gaztanaga (1999) 
also found values of Q3 that lay below the ACDM predic- 
tions. However, they did not perform a proper S/N analysis 
with the covariance matrix to separate 61 from 62 ■ 2 Our re- 
sults are also in qualitative agreement with the values of the 
angular skewness, S3, measured in the APM galaxy survey 
(Gaztanaga 1994). The mean over large angular scales (cor- 
responding to 7—30 hT x Mpc) was estimated to be S3 = 3.8, 
with an error (dominated by sampling covariance) of the or- 
der ^ 10 %. The theoretical predictions for the projected 
moments using perturbation theory (Bernardeau 1995) and 
dark matter simulations (Gaztanaga & Bernardeau 1998) 
yield a mean value S3 ~ 5 over the same scales 3 . The skew- 
ness measured from the APM Survey is thus also below the 
the ACDM prediction. This agrees well with our estimates 
of the bias parameters fei ~ 1 and C2 — —0.3, which give 
s5f ~ (S3 + 3c2)/6i — 4, in excellent agreement with the ob- 

2 We note that such an analysis was, however, presented for Q3 
measured in Fourier space from the IRAS Point Source Redshift 
Catalogue (PSCz, Saunders et al. 2000) by Scoccimarro et al. 
(2001a) and Feldman et al. (2001). 

3 Note that this prediction differs from the hierarchical projec- 
tion for the same model/scales estimated by Gaztanaga (1994), 
which were closer to S3 ~ 4. This was first noted by Bernardeau 
(1995) and later confirmed with simulations by Gaztanaga & 
Bernardeau (1998). See also comments relating to figs. 47 and 
54 in Bernardeau et al. (2002) for further details. 



served APM values. Note that the APM results correspond 
to configuration space, in contrast to our results which are 
in redshift space. Thus, our simple quadratic bias model 
(in redshift space) can account simultaneously for observa- 
tions of real-space (projected) and redshift-space results for 
3-point statistics (both skewness and 3-point function) . Our 
new result solves the long standing observational puzzle re- 
garding how the measured and predicted values of S3 and 
Q3 can be reconciled. 



6 CONCLUSIONS 

We have measured the reduced 3-point function 
Qs(ri, T-2, r-i) ~ C/£ 2 (as defined in Eq. in the final 
2dFGRS catalogue, using triangles of different scales and 
opening angles. We have utilized a range of volume limited 
samples in our analysis, which allows us to look for cluster- 
ing trends as a function of galaxy luminosity. The inclusion 
of rF-band photometry in the final 2dFGRS data release 
also allows us to look for a dependence of the three point 
function on galaxy colour. Another novel aspect of our 
analysis is that we employ an eigenmode decomposition to 
deal with correlations between data-points and to assess the 
signal-to- noise of our measurements; our results typically 
have a signal-to-noise > 20. 

There are two primary motivations for measuring the 
reduced three point function. The first is to test the gravita- 
tional instability paradigm for the formation of large-scale 
structure in the Universe. There are clear predictions for 
the form of the three-point function in the case of an ini- 
tially Gaussian distribution of density fluctuations that have 
evolved under gravity (see Bernardeau et al. 2002) . The sec- 
ond motivation is to provide new constraints on models of 
galaxy formation, by establishing how the three-point func- 
tion of galaxies differs from that of the underlying dark mat- 
ter. It turns out that the predictions for the dark matter are 
insensitive to the amplitude of density fluctuations and to 
the detailed shape of the power spectrum. 

We have divided our analysis into two clustering 
regimes: weakly non-linear clustering (i.e. r Si 6/i -1 Mpc or 
£ 1) and non-linear clustering (r H 6h~ 1 Mpc or £ ^ 1). 
On weakly non-linear scales, there is a striking similarity 
between the shape of Q3 measured for galaxies and the pre- 
dictions for the dark matter. This supports the idea that the 
basic phenomenon behind the clustering pattern of galax- 
ies is gravitational instability, which confirms our previous 
conclusions reached from the analysis of the distribution of 
counts-in-cells for the 2dFGRS (Baugh et al. 2004; Croton 
et al. 2004b). 

There are, however, significant differences, between Q3 
measured for galaxies and the expectations for a ACDM uni- 
verse. We have modelled this discrepancy in terms of a shift 
and a scaling applied to the dark matter predictions. For 
scales on which the fluctuations are weakly nonlinear, the 
scaling can be identified with the linear bias, 61 and the off- 
set with the quadratic bias, 62 jb\. Our best measurement 
of these bias parameters gives a linear bias consistent with 
unity, but a significant detection of a non-zero quadratic 
bias, &2/&1 = — 0.34lQ Qg. This is the first time that the sig- 
nature of a quadratic bias has been seen so convincingly; our 
measurements are 9-cr away from the case in which galaxies 
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faithfully trace the mass (61 = 1 and &2 =0). Our results 
disagree with some of the previous analyses of the three 
point function in the 2dFGRS; a detailed discussion of the 
possible reasons for this is given in Section 5. We note that 
Feldman et al. (2001) also found a negative quadratic bias 
term when analysing the three point function of galaxies in 
the IRAS Point Source Catalogue redshift survey, albeit at 
a less significant level than our detection. 

The discrepancy between Qz for galaxies and the dark 
matter increases as the scale of the triangles is reduced 
(while remaining in the weakly non-linear regime), which 
translates into a slight increase in the best-fitting values of 
the bias parameters (see Table We find no significance 
evidence for luminosity segregation on these weakly non- 
linear scales. 

On smaller scales we are able to detect a significant de- 
pendence of Q3 on scale, color and luminosity. These trends 
appear at first sight to be at odds with the preliminary re- 
sults obtained by Kayo et al. (2004) using the SDSS, al- 
though the errors on the measurements presented by these 
authors are much larger than ours. In all cases, the measure- 
ments for the various samples of galaxies are clearly below 
the predictions for the dark matter. Our detailed measure- 
ments, presented in Fig. 6-8 and Table 3, should provide im- 
portant new constraints on models of galaxy formation (see 
Scoccimarro et al. 2001b; Wang et al. 2004). 

Our strong detection of a quadratic bias offers a new ex- 
planation of the long standing puzzle of why redshift surveys 
have tended to measure a different skewness (S3 ~ = 
2; see Croton et al. 2004b for the 2dFGRS and Table 19 
of Bernardeau et al. 2002 for a summary of other observa- 
tional results) from that predicted for the ACDM cosmology 
(S3 ~ 3). If we take the non- linear bias relation derived by 
Fry & Gaztanaga (1993), S3 = (S3 + 3c2)/&i and insert our 
best-fitting values for the bias parameters (61 = 0.95 and 
C2 = —0.35), then we obtain S3 G ~ 2, just as required by 
the observations. 

The value of Q3 is independent of the overall amplitude 
of fluctuations. This means that our measurement of the lin- 
ear bias, 61, is fully independent of the normalization of the 
fluctuations in the dark matter, as specified by erg. Further- 
more, the predictions for Q3 for dark matter are relatively 
insensitive to the shape of the power spectrum, making this 
estimate of the bias robust to minor changes in the param- 
eters of the ACDM model. We can therefore combine our 
estimate of 61 with the amplitude of fluctuations measured 
from the galaxy distribution, erg, to derive an estimate of 
the amplitude of fluctuations in the dark matter, erg. Cole 
et al. (2005) measured the power spectrum of galaxy clus- 
tering in the 2dFGRS and found erf ~ 0.924 ± 0.032. The 
equation relating fluctuations in the galaxies to those in the 
dark matter involves two other terms: 

a% = 61 D(z) K(f3) ag . (10) 

Here D(z) ~ 0.95 is the growth factor at the mean depth of 
the survey (z ~ 0.1) relative to the growth factor at z — 
and K is the linear Kaiser (1987) redshift space distortion 
factor: K ~ 1.17 for j3 — 0.48. Both factors depend on the 
cosmological density parameters for matter and vacuum en- 
ergy, which we have set to their concordance model values 
(f2 m ~ 0.3, Q,a — 0.7 and h ~ 0.7). This allows us to esti- 
mate erg: 



as ~ 0.88ig;S • (11) 

Here we have assumed that the errors are dominated by 
the errors in bi. This explains the good agreement found 
between the large scale variance in 2dFGRS galaxies and 
the variance of the dark matter for erg ~ 0.9, as shown in 
fig. 2 of Baugh et al. (2004). A more detailed presentation 
of our result for erg will be deferred to a later paper. 

Note added on submission: On the day before our 
paper was submitted, Hikage et al. (2005, astro-ph/0506194 ) 
posted a paper on the three-point function of SDSS galax- 
ies, in which they perform a Fourier Phase analysis. Their 
main result is that &2/&1 ~ if erg = 0.9, in apparent con- 
tradiction with our principal finding. However, Hikage et al. 
consider scales in excess of 30/i -1 Mpc and restrict their at- 
tention to triangles with large opening angles. Their analysis 
is therefore similar to the special case we present in Fig. 9 
for elongated and equilateral triangles. As we explained in 
§4.3, in this case, due to the reduced number of triangles 
considered, the errors on the bias parameters are large. As 
shown in the upper right panel of our Fig. 2, the error bars 
become quite large for a ~ 180 on large scales. Furthermore, 
there is actually no reason to expect the bias parameters ex- 
tracted by Hikage et al. to agree closely with ours, as SDSS 
galaxies are red selected. 
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